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Abstract 

In an implicit combinatorial optimization problem, the constraints are 
not enumerated explicitly but rather stated implicitly through equations, 
other constraints or auxiliary algorithms. An important subclass of such 
problems is the implicit set cover (or, equivalently, hitting set) problem 
in which the sets are not given explicitly but rather defined implicitly 
For example, the well-known minimum feedback arc set problem is such 
a problem. In this paper, we consider such a cover problem that arises 
in the study of wild populations in biology in which the sets are defined 
implicitly via the Mendelian constraints and prove approximability results 
for this problem. 

Keywords: Implicit set cover, Computational Biology, Inapproximbility 

1 Introduction 

In an implicit combinatorial optimization problem, the constraints are not enu- 
merated explicitly but rather stated implicitly through equations, other con- 
straints or auxiliary algorithms. Well-known examples of such optimization 
problems include convex optimization problems where the constraints are not 
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given explicitly but rather can be queried implicitly through a separation ora- 
cle or given by an auxiliary algorithm. For example, the ellipsoid method can 
be used to solve in polynomial time a linear programming problem with possi- 
bly exponentially many constraints provided we have a separation oracle that, 
given a tentative solution, in polynomial time either verifies that the solution is 
a feasible solution or provides a hyperplane separating the solution point from 
the feasible region. This paper concerns the implicit set cover problems which 
are defined as follows. In the standard (unweighted) version of the set cover 
problem, we are given a collection of subsets S over an universe of elements U 
and the goal is to find a sub-collection of sets from S of minimum cardinality 
such that the union of these sets is precisely U. A combinatorially equivalent 
version of the set cover problem is the so-called hitting set problem where one 
needs to pick instead a subset of the universe U of minimum cardinality which 
contains at least one element from every set. Set cover and hitting set problems 
are fundamental problems in combinatorial optimization whose computational 
complexities have been throughly investigated and well understood [11]. More 
general version of the problem could include generalizing the objective function 
to be minimized, namely the number of sets picked, by say having weighted 
sets and minimizing the sum of weights of the selected sets, or by defining a 
monotone objective function on the set system. 

Implicit set cover (or hitting set) problems have the same standard setting, 
but the sets are not given explicitly hut rather implicitly through some im- 
plicit combinatorial constraints. For example, the minimum feedback vertex 
set or the minimum feedback arc set problems are examples of such implicit 
hitting set problems. Such implicit set cover or hitting set problems can be 
characterized by not giving the collection of sets S explicitly but via an efficient 
(polynomial-time) oracle O that will supply members of <S satisfying certain 
conditions. For example, the recent work of Richard Karp and Erick Moreno 
Centeno^ considers some implicit hitting set problems with applications to mul- 
tiple genome alignments in computational biology in which the oracle O provides 
a minimum-cardinality set (or a good approximation to it) from the collection 
S that is disjoint from a given set Q. In addition to standard polynomial-time 
approximation guarantees, one could also invoke other measures of efficiencies, 
such as number of access to the oracle O to obtain an optimal or near-optimal 
solution to the hitting set problem as used by Karp and Centeno. 

In this paper, we consider an implicit (unweighted) set cover problem, which 
we call the MIN-PARENT problem, that arises in the study of wild population. 
Our problem in the setting described above is roughly as follows. Our oracle O 
returns, given a sub-collection of elements tC C U, if there is a set that includes 
W. Our specific objective function is motivated by the biological application 
and is a monotone function, namely including a new element in our collection 
does not decrease it. More precise formulations of our problems appear in the 
next section and will easily convince the reader that our problem is not captured 
by previous works or the recent work by Karp and Centeno. 

^Richard Karp, UC Berkeley, personal communication. 
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2 Motivations 



For wild populations, the growing development and application of molecular 
markers provides new possibilities for the investigation of many fundamental 
biological phenomena, including mating systems, selection and adaptation, kin 
selection, and dispersal patterns. In our motivation we are concerned with full 
sibling relationships from single generation sample of microsatellite markers. 
Several methods for sibling reconstruction from microsatellite data have been 
proposed (e.g., see [6, 10, 12]). Combinatorial approaches to sibling reconstruc- 
tion using suitable parsimony assumptions have been stiiclied in [2 5]. These 
approaches use the Mendelian inheritance rules to impose constraints on the 
genetic content possibilities of a sibling group. A formulation of the inferred 
combinatorial constraints in constructing a collection of groups of individuals 
that satisfy these constraints under the parsimony assumption of a minimum 
number of parents leads to the MIN- PARENT problems discussed in the paper. 

3 Precise Formulations of MIN-PARENT Prob- 
lems 

An element (individuat) u is an ordered sequence {ui,U2, ■ ■ ■ ,ue) where each 
Uj is a genetic trait {locus) and is represented by a multi-set {ujfl,Uj^i\ of 
two (possibly equal) numbers {alleles) inherited from its parents. Biologically, 
each element corresponds to an individual in the sample of the wild population 
from the same generation. We have a universe hi consisting of n such elements. 
Certain sets of individuals in U can be full siblings, i.e. having the same pair 
of parents under the Mendelian inheritance rule. These sets are specified in 
an implicit m.anner in the following way. The Mendelian inheritance rule states 
that an individual u = {ui, U2, ■ . ■ , Ui) G U can be a child of a pair of individuals 
{parents), say v = {vi, V2, ■ ■ ■ , vi) and w = {wi, W2, . . . , uii), if and only if for 
each locus j G {!....,(.} one allele of Uj is from Vj and the other element of 
Uj is from Wj. Finally, a subset U' Qhi is a, {full) sibling group if and only if 
there exists a pair of parents v and w such that every member of W is a child 
of V and w. Note that any pair of individuals is a full sibling groiip by the 
Mendelian constraints. As an illustration, the four individuals (with £ = 2 loci) 
({1, 2}, {1, 1}), ({4, 3}, {6, 6}) and ({1, 2}, {1, 6}) form a full sibling group since 
they can be the children of the two parents ({1, 3}, {1, 6}) and ({2, 4}, {1, 6}). 

Given these Mendelian constraints, our goal is to cover the xmiYcrselA by a set 
of full-sibling groups under the parsimonious assumption of a minimum number 
of parents. Formally, the MIN-PARENT„^^ problem is defined as follows. 

Problem name: MIN-PARENT„,^ 

Input: Our input is an universe U oin individuals each with (. loci. 

Valid Solutions: a cover A oiU such that each set S & A in the cover is a 
sibling group. 
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Notation: B{A) denote a set of individuals (parents) such that every set S 
(sibhng group) in the cover has its two parents from B{A). 

Objective for minimization: minimize 1-6(^^)1 = min^ l'8(.4)| 

In the setting of the implicit set cover problems described before, our cover 

problem is as follows: 

• Our sets (sibling groups) are defined implicitly by the Mendelian con- 
straints; note that the number of such sets is possibly exponential and 
thus we cannot always enumerate them in polynomial time. 

• Our polynomial time oracle O answers queries of the following type: given 
a given subset W C U of the universe, does W form a valid (sibling) set 
following the Mendelian constraints^? It is easy to show a polynomial- time 
implementation of the oracle {e.g., see [4]). 

Finally, note that our objective function is obviously monotone since W G U 
implies \B{W)\ < \B{1{)\. A natural parameter of interest in covering prob- 
lems the maximum size (number of elements) a in any set. For our problem, 
the parameter a corresponds to maximum number of individuals of any sibling 
group. 

We first show that the MIN-PARENT problem is MAX-SNP-hard even if 
a = 3. This leads us to the question about the computational complexity of 
the problem for arbitrary a. We will show that, for arbitrary a, it is very hard 
to even find an approximation to a minimum set of parents for a given sibling 
partition of the universe with given a candidate set of parents that includes 
an optimal set of parents. Formally, the FIND-MIN-PARENT„_^ is defined as 
follows. 

Problem name: FIND-MIN-PARENT„,i^. 

Input: a partition ^ of a set U of n elements, each with i loci, such that each 
set S in the partition >1 is a sibling set, and a set of elements (possible 
parents) V. 

Valid Solutions: any B{A) provided that B{A) C V. 

Notation: B{A) denote a set of individuals (parents) such that every set S 
(sibling group) in the cover has its two parents from B{A). 

Objective for minimization: minimize \B{U)\ = ming(_4)c-p |'S(-^)|- 

^Notc that if W is not a valid set, the oracle O does not provide any hint about other 
possible valid sets. 
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3.1 Standaird Terminologies 

Recall that a (1 + e)- approximate solution (or simply an (1 + £)-approximation) 
of a minimization (resp. maximization) problem is a solution with an objective 
value no larger (resp. no smaller) than 1 + s times (resp. (1 + s)~^ times) the 
value of the optimum, and an algorithm achieving such a solution is said to have 
an approximation ratio of at most l + £. A problem is r-inapproximable under a 
certain complexity-theoretic assumption means that the problem does not have 
a r- approximation unless the complexity-theoretic assumption is false. 

L-reductions are a special kind of approximation-preserving reduction that 
can be used to show MAX-SNP-hardness of an optimization problem. Given two 
optimization problems 11 and H', 11 L-reduces to H' if there arc three polynomial- 
time procedures Ti,T2, T3 and two constants a and 6 > such that the following 
two conditions are satisfied: 

(1) For any instance / of H, algorithm Ti produces an instance /' = /(/) of 11' 

generated from Ti such that the optima of / and /', OPT {I) and OPT {I'), 
respectively, satisfy OPT {I') < a ■ OPT {I). 

(2) For any solution of /' with cost c', algorithm T2 produces another solution 

with cost c" that is no worse than c', and algorithm T3 produces a solution 
of / of n with cost c (possibly from the solution produced by T2) satisfying 
|c - OPT{I)\ < b ■ \c" - OPT{I')\. 

An optimization problem is MAX-SNP-hard if another MAX-SNP-hard prob- 
lem L-reduces to that problem. Arora et al. [1] show that, assuming P^NP, ev- 
ery MAX-SNP-hard problem is (1 -|- £)-inapproximable for some constant £ > 
unless P=NP. 

3.2 Our Results 

For MIN-PARENT„,i?, we show in Section 4 that the problem is MAX-SNP- 
hard even if a = 3 and observe in Section 5 that for any a and any integer con- 
stant c > the problem admits an easy -I- Inc) -y/n-approximation. We show 
in Section 6 that, for arbitrary a, FIND-MIN-PARENT„,<; admits no 2^°^' 
approximation, for every constant < £ < 1, unless NPCDTIME(nP''''''°s(")). 

4 Inapproximability of MIN-PARENT for a = 3 

Lemma 1 MIN-PARENT^,/. is MAX-SNP-hard even if a = 3. 

Proof. For notational simplification, when an individual has the multiset 
{x, x} in a locus, we will refer to it by saying that the individual has a "label" 
of value x in that locus. Our construction will ensure that all individuals have 
only one label at every locus. It is then easy to check that a set of individuals 
can be a sibling set if and only if at each locus they have labels with no more 
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than two distinct values. In the sequel, we will use the terminologies "label x" 
and "locus {x, x}" interchangeably. 

The (vertex-disjoint) triangle-packing (TP) problem is defined as follows. 
Wc arc given an undirected connected graph G. A triangle is a cycle of 3 nodes. 
The goal is to find (pack) a maximum number of node- disjoint triangles in G. 
TP is known to be MAX-SNP-hard even if every vertex of G has degree at most 
4 [7]. Moreover, the proof in [7] show that the MAX-SNP-hard instances of TP 
in their reduction produces an instance of TP with n nodes in which an optimal 
solution has an triangles for some constant < a < 1. 

Wc will provide an approximation preserving reduction from an instance 
graph G of n nodes of TP with nodes of G having a maximum degree of 4 as ob- 
tained in [7] to MIN-PARENT„^^. We introduce an individual u for every node 
u of the graph G and provide ordered label sequences for each node (individual) 
such that: 

(1) Three individuals corresponding to a triangle of G have at most two values 

in every locus and thus can be a sibling set. 

(2) Three individuals that do not correspond to a triangle of G have at least 

three values in some locus and thus cannot be a sibling set. 

(3) Consider any maximal set of vertex disjoint triangles in G and the corre- 

sponding sibling sets (each of size 3). Partition the remaining vertices of 
G not covered by these triangles arbitrarily into pairs (groups of size 2) 
and consider the corresponding full sibling sets (each of size 2). Then, 
each sibling set in the above collection requires two new parents. 

Note that since we have a maximal set of triangles, no three vertices in 
the set of pairs can form a triangle. Conversely, given any solution of 
MIN-PARENT„_^, we preprocess the solution to get a canonic;al solution 
to ensure that no three individuals in the union of pairs can be a sibling 
set; this preprocessing does not increase the number of sibling sets. 

Note that, since any pair of individuals can be a full sibling set, the above 
properties imply that TP has a solution with t triangles if and only if the MIN- 
PARENT problem can be solved with 2t + 2- = n-t parents. 

The MAX-SNP-hardness now follows easily since an optimum solution of TP 
on G has an triangles for some constant < a < 1. More precisely, let / and 
/' be the instance of TP and the corresponding instance of MIN-PARENT„.£, 
respectively, and let OPT(/) and OPT(/') denote the number of triangles and 
the number of parents in an optimal solution of / and /', respectively. Then, 
the following two statements hold. 

(a) Since OPT(/) an we have OPT(J') =n-an= ^an = (i^) 0PT(7) 

where is a positive constant. 

(b) Since OPT(/') = n — anwe must have c' > n — an. Thus, if c' = n — an + x 

(for some x) is the number of parents in a solution of the instance /' after 
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preprocessing then number of triangles in the sohition of the instance / of 
TP is given by c = n-c' = an-x and thus |c - OPT{I) \ = \c' - OPT(J')|. 

Now, we describe the reduction. 

Our first set of loci are as follows. The index of a locus, which we call the 
"coordinate", is defined by an "origin" node u. Thus, we will have \ V\ such loci, 
one for every node u. The respective label of an individual v at this coordinate 
is the distance (number of edges in a shortest path) from u to v. 

Our second set of loci arc as follows. We have such a locus for every set 
of three vertices that does not form a triangle. Thus, we will have 

0(|yp) such loci. Since the three vertices do not form a triangle, at least one 
pair of them, say u and v, arc not connected by an edge. As a result, the set 
of vertices {u, v, x} do not form a triangle for any other vertex x ^ {u, v}. Our 
goal is to ensure that the vertices u, v and w cannot be a sibling group while 
not disallowing any other sibling groups that can be formed by a triangle in the 
graph. This is easy to do. Put the label 1 in this locus for the individual u, 
label 2 for individual v and label 3 for all other individuals. 

First we need to check that Property (1) holds. The following is true with 
respect to the first set of loci. Consider a triangle {u,v,w}, any locus (coordi- 
nate) £ and assume that u has the minimum label value of L, i.e., it is nearest 
to the origin node that defined £. Then labels of v and w arc at least L and at 
most L + 1, hence u, v and w have at most two labels at i. The second set of 
loci never disallows a sibling group corresponding to a triangle, so the property 
is not violated by them cither. 

The construction of the second set of loci implies that Property (2) is true. 

Finally, we need to verify Property (3). There are three cases to verify. 

First, consider the case when we have two sibling groups correspond to two 
triangles Ti = {ujVjw} and T2 = {p,q,r} in G. Note that since nodes in G 
have a maximum degree of 4, any node of one triangle can be connected to at 
most two nodes in the other triangle. 

The locus i defined by the origin node u has a label for u and a label 1 
for V and w. Thus, the sibling set {u, v, w} can be generated only by a pair of 
parents, say A and B, each of which has the alleles {0, 1} in locus £. 

Since u is connected to at most two nodes in T2, it is not connected to a 
node in T2, say r. Then, r must have a label a; > 2 in locus £. Thus, neither A 
nor B can be a parent of the sibling group {p, q, r} since x ^ {0, 1}. 

Second, consider the case when the we have two sibling groups corresponding 
to a triangle T = {u,v,w} and a pair P = {p,q}. Consider the locus defined 
by the origin node u. We have a label for u and a label 1 for v and w in this 
locus. Thus, the sibling set {u, v, w} can be generated only by a pair of parents, 
say A and B, each of which has the alleles {0, 1} in this locus. If node u is not 
connected to both nodes p and q then one of the nodes which is not connected 
to u, say p, must have a label x > 2 in this locus. Thus, neither A nor B can 
be a parent of the sibling group {p, q} since x ^ {0, 1}. Otherwise, it must be 
the case that u is connected to both p and q. 
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Repeating the same argument with q as the origin node and then r as the 
origin node shows that the only case that remains to be considered is when each 
of w, V and w is connected to both the nodes p and q. But, then the induced 
subgraph of G with vertices ?i, v, w, p and g is a 5-chque. Since every node 
in G has a degree of no more than 4, this imphes that G has more than one 
connected component, contradicting the fact that G was a connected graph. 

Finally, consider the case when we have two sibling groups corresponding 
to two pairs Pi = {u,v} and P2 = {p,q}- Since we have preprocessed the 
solution of MIN-PARENT„^^ or equivalently have a maximal set of triangles for 
the solution of TP, node u is not connected to at least one node in P2, say p. 
The locus defined by the origin node u has a label for u and a label 1 for v, 
but has a label a; > 2 for p. Thus, the sibling set {u, v} can be generated only 
by a pair of parents, say A and B, each of which has the alleles {0, 1} in the 
corresponding locus, but neither A nor B can be a parent of the sibling group 
{p, q} since x ^ {0, 1}. □ 

5 A Simple Approximation Algorithm for MIN- 
PARENT„^ 

Note that we do not need to know the value of a in the theorem below. 

Observation 2 Let a be the maximum size of any sibling set. Then, for any in- 
teger constant c > 0, MIN-PARENT admits an easy ^ — hlnc^ ^/n- approximation 
with polynomially many access to the oracle O (and, thus in polynomial time). 

Proof. Our proof is similar to the analysis of a standard greedy algorithm for 
set cover problems [11]. 

Suppose that we have a subset W CU oi the universe that is still not covered. 
We can enumerate all subsets of W of size at most c in 0{rf) time and for each 
subset query the oracle O to find if any of these subsets of individuals are full 
siblings for the MIN-PARENT„^^ problem. Thus we can assume that for every 
instance of the problem, either the maximum sibling set size is below c and we 
can find such a group of maximum size, or we can find a sibling set of size c. 
Our algorithm simply selects such a set, removes the corresponding elements 
from W and continues until all elements of U are covered. 

Obviously, all subsets of a sibling set are valid sibling sets too. Let OPT 
be the minimum number of parents in an optimal solution of MIN-PARENT„^£. 
Consider an optimum solution, make it disjoint by arbitrarily shrinking each full- 
sibling set and let a be the number of sets in this partition. Obviously, a <n/2. 
Since no two full-sibling sets are produced by the same pair of parents (because 

of minimality), (^^'^) > a which implies OPT> \/2a. We distribute the cost 
of our solution among the sets of the optimum. When a set with b elements 
is selected, we remove each of its element and charge the sets of the optimum 
1/6 for each removal. It is easy to see that a set with a elements will get the 
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sequence of charges with values at most (1/c, . . . , 1/c, l/(c — 1), l/(c — 2), . . . , 1) 



and these charges add to ^ — 1 + I = f + Z)i=2 I < f + In c. Thus, we use 
at most (- + Inc) a sibling groups. Each sibling group can be generated by at 

most two new parents. Thus, the total number of parents necessary to generated 
these sibling groups is at most + Inc) ^/2a OPT< (| + Inc) y^OPT. □ 



6 Inapproximability of FIND-MIN- PARENT 

Lemma 3 For every constant < e < 1, FIND-MIN-PARENTn^i admits no 

2iog= n _ approximation unless NPC D TIME{nP°'-y ) . 

Proof. We first need the MINREP problem which is defined as follows. We 
are given a bipartite graph G = {A,B,E). We are also given a partition of 
A into |A|/q! cqual-sizc subsets Ai,A2, . . . ,Aa and a partition of B into \B\//3 
equal-size subsets Bi, B2, ■ ■ ■ , Bfj. These partitions define a natural "bipartite 
super-graph" H in the following manner. H has a "super-vertex" for every 
Ai (the left partition) and a "super- vertex" for every Bj (the right partition). 
There exists an "super-edge" between the super- vertex Ai and the super- vertex 
Bj if and only if there exists u € A^ and v e Bj such that {u, u} is an edge of G. 
A pair of vertices u and v "witnesses" a super-edge {Ai^Bj} provided a E Ai, 
b G Bj and the edge {a, b} exists in G. A set of vertices 5* of G witnesses a 
super-edge if there exists at least one pair of vertices in S that witnesses the 
super-edge. The goal of the MINREP problem is to find A' C A and B' C B 
such that AU B witnesses every super-edge of H and the size of the solution, 
namely -|- is minimum. 

For notation simplicity, let n = \A\ + \B\. The following result is a conse- 
quence of Raz's parallel repetition theorem [8, 9]. Let L G NP and < £ < 1 be 
any fixed constant. Then, there exists a reduction running in quasi-polynomial 
time, namely in time 72^°'^'°^^"-', that given an instance a; of L produces an 
instance of MINREP such that ii x € L then MINREP has a solution of 
size at most at most a + (3, but if a; ^ L then MINREP has a solution of 
size at least {a + (3) ■ 2'°^ Thus, the above theorem shows that MIN- 
REP has no 2'°^ "-approximation under the complexity-theoretic assumption 
of NP2DTIME(nP°'y'°s(n))_ 

Let L be any language in NP. Use the above theorem to translate an instance 
X of L to an instance of MINREP as described above. Now, we describe a trans- 
lation of this instance of MINREP to an instance of FIND-MIN-PARENTp,„,^. 

We have a parent in V corresponding to every element v G A\J B. We 
have an individual Safi in U for every edge {a, &} in G. Thus, the number 
of possible parents in "P is n and the number of individuals in U is O(n^). 
It therefore suffices to prove a 2'°^^ I ''I -inapproximability since that implies as 
2iog° I'^l-inapproximability. 

Before describing our reduction, we need a generic construction of the fol- 
lowing nature to simplify our description. We are given two elements PuiPv ^'P 
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and an clement Sa^b G ^- We want to add a new locus with appropriate allele 
values to ensure that Sa,6 cannot be a child of Pu and p„ , but no other parent- 
child relationship is forbidden. This is easy to do. Put the alleles {a,b} in this 
locus for Pu and p-^ and put the alleles {a, c} in this locus for every individual 
(including Sa,b) in {PUU) \ {pu,Pv}- It follows that Sa,b cannot be a child of 
Pu and py since c {a,b}, but no other child-parent combination is forbidden 
since {a, c} can be produced by the Mendelian rule either from {a, b} and {o, c} 
or from {a, c} and {a, c}. 

Now, we add additional loci to the individuals in U L\V in the following 
manner following the two rules: 

Rule (★): For every edge {u, v} of G with u € Ai and v € Bj and for every pair 
of vertices {a, 6} such that {a, b} e £'\{ {y, z}\y € Ai, z G Bj, {y, z} £ E} 
we add an additional locus using the generic construction to ensure that 
So, 6 cannot be a child of p„ and p„. 

Rule {*rk): For every pair of vertices u and w of G such that {u, v} ^ E and for 
every pair of vertices a and 6 of G such that {a, b} e E, we add an addi- 
tional locus using the generic construction to ensure that the individual 
Sa,b € ^ cannot be a child of the parents p„ and py in V. 

We build each individual in U U V locus-by-locus in the above manner. Our 
partition A ofU to sibling groups is defined as follows: we have a sibling group 
Aij = {{sa^b} I {o,,b} witnesses the super-edge {Ai,Bj} } for every super-edge 
{Ai,B,}. 

First, we need to verify that each of our sibling set is indeed a sibling set. 
Consider the sibling set Aij. Pick any u Ai and v G Bj such that {u, v} € E, 
i.e., {u, v} witnesses the super-edge {Ai, Bj}. We claim that pu and Pv are the 
parents for all individuals in Aij. Indeed, the two rules allow this. 

Suppose that MINREP has a solution of size 7. This generates a set of 7 
parents for FIND-MIN- PARENT in an obvious manner: for every vertex v in 
the solution of MINREP we pick the individual Py in the solution of FIND- 
MIN-PARENT. If the super-edge {Ai,Bj} is witnessed by the edge {u,v} in 
the solution of MINREP, then the sibling set Aij is generated by the parents 
Pu and Py. 

Conversely, suppose that FIND-MIN-PARENT has a solution with 7 par- 
ents. We associate each parent Pu to the corresponding vertex u of G in our 
solution of MINREP. Consider a super-edge {Ai, Bj} and the associated sibling 
set Aij. Suppose that pu and pv are the parents of this group. By Rule (*★), 
{u,v} e E. By Rule (*), one of pu and Pv, say p„, must be from Ai and the 
other one py from Bj. Thus, the edge {u,v} witnesses this super-edge. □ 

Remark 1 The above reduction works even if one does not specify the set A of 
sibling partition explicitly as part of input but allows all feasible partitions. 

Acknowledgements. We thank Richard Karp for his talk at ICS-2009 that 
motivated us to think about our problem as an implicit cover problem. 
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